Automatic Parallelization - New Approaches to Code Generation, Data Distribution, and Performance Prediction
نویسنده
چکیده
This paper introduces the Weight Finder, an advanced profiler for Fortran programs,which is based on a von Neumann architecture. Existing Fortran codes are generally too large toanalyze fully in depth with respect to performance tuning. It is the responsibility of the WeightFinder to detect the most important regions of code in the program, as far as execution time isconcerned. Program transformation systems, compilers and users may then subsequently con-centrate their optimization efforts upon these areas in code.Furthermore program unknowns, such as loop iteration counts, true ratios and frequency infor-mation, are derived. Analysis and prediction systems require concrete values for these unknownsin order to provide reasonable accurate results.Animation, simulation, debugging and trace based tools may use the profile data as beingderived by the Weight Finder in order to detect program parts which are never executed, simulatethe program’s control flow, etc.This tool is based on an optimized instrumented profile run. Several optimizations are shownwhich eliminate large portions of the instrumentation code, thus decreasing profile run-time andmemory requirements, and improving the measurement accuracy. It is shown how the profile datacan be adapted for program transformations to the original Fortran program without redoing theprofile run.The Weight Finder is a 10000 line software package which is fully integrated under the ViennaFortran Compilation System.
منابع مشابه
ADAPT: Automated De-Coupled Adaptive Program Transformation
Dynamic program optimization o ers performance improvements far beyond those possible with traditional compile-time optimization [1, 2, 3, 4]. These gains are due to the ability to exploit both architectural and input data set characteristics that are unknown prior to execution time. In this paper, we propose a novel framework for dynamic program optimization, ADAPT (Automated De-coupled Adapti...
متن کاملFinal Report : Compiler - Driven Performance Optimization and Tuning for Multicore Architectures Report Title
Final Report: Compiler-Driven Performance Optimization and Tuning for Multicore Architectures Report Title The widespread emergence of multicore processors as the computing engine in all commodity platforms presents our field with an enormous software development crisis. For over two decades, sequential software applications have enjoyed the free-ride of performance improvement with each new pr...
متن کاملOptimizing Cost Function in Imperialist Competitive Algorithm for Path Coverage Problem in Software Testing
Search-based optimization methods have been used for software engineering activities such as software testing. In the field of software testing, search-based test data generation refers to application of meta-heuristic optimization methods to generate test data that cover the code space of a program. Automatic test data generation that can cover all the paths of software is known as a major cha...
متن کاملPerformance Evaluation of Parallel Simd
A simulator for SIMD type architectures is presented. Starting from an architecture independent algorithm description based on recurrence equations, transformation steps for automatic parallelization, mapping and code generation are outlined. The nal pseudo code program together with architecture dependent parameters and execution time tables, are fed into the simulator in order to gain perform...
متن کاملAutomatic Resource-Constrained Static Task Parallelization : A Generic Approach. (Parallélisation automatique et statique de tâches sous contraintes de ressources : une approche générique)
This thesis intends to show how to efficiently exploit the parallelism present in applications in order to enjoy the performance benefits that multiprocessors can provide, using a new automatic task parallelization methodology for compilers. The key characteristics we focus on are resource constraints and static scheduling. This methodology includes the techniques required to decompose applicat...
متن کامل